Tree-Based Policy Learning in Continuous Domains through Teaching by Demonstration

نویسندگان

  • Sonia Chernova
  • Manuela Veloso
چکیده

This paper addresses the problem of reinforcement learning in continuous domains through teaching by demonstration. Our approach is based on the Continuous U-Tree algorithm, which generates a tree-based discretization of a continuous state space while applying general reinforcement learning techniques. We introduce a method for generating a preliminary state discretization and policy from expert demonstration in the form of a decision tree. This discretization is used to bootstrap the Continuous U-Tree algorithm and guide the autonomous learning process. In our experiments, we show how a small number of demonstration trials provided by an expert can significantly reduce the number of trials required to learn an optimal policy, resulting in a significant improvement in both learning efficiency and state space size.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Confidence-Based Robot Policy Learning from Demonstration

The problem of learning a policy, a task representation mapping from world states to actions, lies at the heart of many robotic applications. One approach to acquiring a task policy is learning from demonstration, an interactive technique in which a robot learns a policy based on example state to action mappings provided by a human teacher. This thesis introduces Confidence-Based Autonomy, a mi...

متن کامل

The Effect of Teaching through Demonstration on Midwifery Student's Self-efficacy in Delivery Management

Introduction: Active learning methods are becoming increasingly popular in midwifery students education. So the aim of this study was to determine the effect of education using demonstration on midwifery student's self-efficacy in delivery managementMethods: This quasi-experimental study was performed in 2013 in Isfahan University of Medical Sciences. Thirty midwifery students were selected thr...

متن کامل

Comparison of Video-Based Instruction and Instructor Demonstration on Learning of Practical Skills in Nursing Students

Introduction: Since technology has an important role in the improvement of educational quality, finding better methods of teaching and learning and improving equipment and teaching materials is emphasized. Regarding this, two educational methods- presentation by the instructor and video presentation, were offered and their effectiveness on nursing students’ learning skills was compared. Method...

متن کامل

Constructing Skill Trees for Reinforcement Learning Agents from Demonstration Trajectories

We introduce CST, an algorithm for constructing skill trees from demonstration trajectories in continuous reinforcement learning domains. CST uses a changepoint detection method to segment each trajectory into a skill chain by detecting a change of appropriate abstraction, or that a segment is too complex to model as a single skill. The skill chains from each trajectory are then merged to form ...

متن کامل

Confidence-Based Multi-Robot Learning from Demonstration

Learning from demonstration algorithms enable a robot to learn a new policy based on demonstrations provided by a teacher. In this article, we explore a novel research direction, multi-robot learning from demonstration, which extends demonstration based learning methods to collaborative multi-robot domains. Specifically, we study the problem of enabling a single person to teach individual polic...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006